(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(11) 



EP 1 431 392 A1 



(12) 



EUROPEAN PATENT APPLICATION 

published in accordance with Art. 158(3) EPC 



(43) Date of publication: 

23.06.2004 Bulletin 2004/26 

(21) Application number: 02796359.4 

(22) Date of filing: 29.08.2002 



(51) Intel/: C12N 15/29, C12N 15/82, 
C12N 5/14 

(86) International application number: 
PCT/JP2002/008739 

(87) International publication number: 

WO 2003/01 8808 (06.03.2003 Gazette 2003/1 0) 



(84) Designated Contracting States: 

AT BE BG CH CY CZ DE DK EE ES Fl FR GB GR 
IE IT Li LU MC NL PT SE SK TR 
Designated Extension States: 
AL LT LV MK RO SI 

(30) Priority: 31.08.2001 JP 2001264156 

(71) Applicant: Riken 

Wako-shl, Saltama 351-0198 (JP) 

(72) Inventors: 

• MATSUI, Minaml, c/o Yokohama Institute, Rlken 
Yokohama-shf, Kanagawa 230-0045 (JP) 

• ICHIKAWA,Takanari, 

c/o Yokohama Institute, Riken 
Yokohama-shl, Kanagawa 230-0045 (JP) 



• NAKAZAWA, Mlkl,c/o Yokohama Institute, Rlken 
Yokohama-shi, Kanagawa 230-0045 (JP) 

• SEKI, Motoaki, c/o Tsukuba Institute, Rlken 
Tsukuba-shi, Ibaraki 305-0074 (JP) 

• FUJtTA, MJkl, c/o Tsukuba Institute, Riken 
Tsukuba-shi, Ibaraki 305-0074 (JP) 

• SHINOZAKI, Kazuo, c/o Tsukuba Institute, Riken 
Tsukuba-shi, Ibaraki 305-0074 (JP) 

(74) Representative: 

Denison, Christopher Marcus et al 

Mewburn Ellis LLP 

York House 

23 Kings way 

London WC2B 6HP (GB) 



(54) PLANT SYSTEM FOR COMPREHENSIVE GENE FUNCTION ANALYSIS WITH THE USE OF 
FULL-LENGTH cDNA 



(57) A gene causative of a phenotype is easily spec- 
ified and thus gene functions are comprehensively an- 
alyzed. A method for analyzing a genefunction compris- 
es: (a) infecting a plant group with a full-length cDNA 
library containing an expression regulatory sequence to 
introduce the cDNA into the plants; (b) selecting a plant 



complying with desired selection requirements from the 
plants having the cDNA introduced thereinto; (c) isolat- 
ing the cDNA from the selected pfant; and (d) re-intro- 
ducing the thus isolated cDNA into a plant of the same 
species as the plant into which the cDNA has been in- 
troduced, to reconfirm a phenotype according to the se- 
lection requirements. 
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Description 

TECHNICAL FIELD 

5 [0001] The present invention provides a plant system for comprehensive gene function analysis with the use of full- 
length cDNAs. 

BACKGROUND ART 

10 [0002] To analyze gene functions, methods for introducing a point mutation or introducing an insertion or deletion 
mutation fnto a gene are conventionally employed. For the Introduction of a point mutation, a method for chemically 
treating the whole genome with a mutation-inducing reagent is common. However, according to this method, one 
nucleotide substitution must be searched for from among nucleotide sequences constituting hundreds of millions of 
genomic DNAs though a mutation can easily be introduced, and therefore identification takes much time. Accordingly, 

15 it may be said that the method Is unsuitable for determining the identification offunctions of tens of thousands of genes 
comprehensively and at high speeds. 

[0003] A gene tagging method is known as a method for introducing a mutation into a gene with high efficiency and 
examining gene function in a short period of time. According to this method, a known gene fragment (a tag) is inserted 
into a genome at random and a gene function at the insertion site is disrupted. For plants, a T-DNA or a transposon 

20 is used as such a gene tag (Krysan, P. J. etal., Plant Cell, 1999. 11(12): pp. 2283-90; Speulman E. etal., Plant Cell, 
1999. 11(10): pp. 1853-66). A gene fragment is randomly inserted into a genome by infection of a plant with Agrobac- 
terium and by crossing with a plant having transposase, in the cases of a T-DNA and a transposon, respectively. Then, 
one or two copies of T-DNA are usually inserted per plant individual (Azpiroz-Leehan, R. et al., Trends Genet, 1997. 
13(4): pp. 1 52-6), and in the case of a certain type of transposon, one copy thereof ts inserted into a genome (Fedoroff, 

25 n. etal,, Bioessays, 1995. 17(4): pp. 291-7). By preparing tens of thousands of such insertion mutation strains, a group 
of plant strains wherein individual gene functions are disrupted can be produced. 

[0004] A plant exhibiting a mutant character of interest is isolated and thereafter the relationship between the mutant 
character and the gene is examined. In such case, genetic information adjacent to the insertion site can be obtained 
by methods such as PCR using the introduced gene fragment as a clue, and thereby gene function can be identified 
30 comprehensively and at high speeds (Krysan, P. J. et al., Plant Cell, 1999. 11 (12): pp. 2283-90; Speulman, E. et al, 
Plant Cell, 1999, 11 (10): pp. 1853-66). 

[0005] As an improved form of this gene tagging method, an activation tagging method is known. This activation 
tagging method can bring about the transcriptional activation of a gene existing adjacent to the genome, into which 
theT-DNA has been inserted, by the use of a transcriptional enhancer sequence incorporated into the T-DNA. In recent 

35 years; this method has been developed as a new analytical method regarding plant gene functions (Walden, R. et al., 
Plant Mol Biol, 1 994. 26(5): pp. 1521-8). Among the features of this activation tagging method, a feature for enabling 
the production of a dominant mutation by a tag is considered to be the most important. In other words, it is possible to 
observe even a phenotype attributable to a gene mutation of a type having an overlapping function with other genes 
(e.g„ gene group constituting a gene family). This feature has never been observed through the production of a con- 

40 ventional gene disrupted type of mutant. 

[0006] However, there is one significant problem when such activation tagging method is used for comprehensive 
analysts of gene functions (analyzing gene functions existing on a genome in a group). That is, a genome region that 
can potentially be transcriptionally activated is extended to approximately 5 kb backward and forward from the insertion 
site since the enhancer sequence is used as an activator inside the tag (Weigel, D« et al., Plant Physiol, 2000. 122(4): 

45 pp. 1003-13). Since two or more genes are present on average in a genome region of 10 kb in a model plant like 
Arabidopsls thaliana, It is difficult to determine which gene is activated by the enhancer. Therefore, in orderto specify 
a causative gene, these genes adjacent to the insertion site are all isolated and transformations thereof are carried 
out again for enforced expression, so that the reproduction of the phenotype is confirmed. Through this confirmation, 
it is essential to examine which gene has functioned to define the phenotype. This means that time on a yearly scale 

so is required on average for analyses from the isolation of a plant exhibiting a trait of Interest to the specification of a 
causative gene. Accordingly, in an attemptto use the activation tagging method forcomprehensive analysis of genomic 
gene functions, the novelty of the obtained gene species or phenotype can be recognized. However, a significant 
contradiction that merits such as rapid specification of a gene and applicability to comprehensive analysis cannot be 
obtained, which can be found in the case of a conventional tagging method which is a gene disrupted type. 

55 [0007] The present invention has an object to provide a next-generation activation tagging system, whereby a caus- 
ative gene of a phenotype can easily be specified and gene functions can be comprehensively analyzed. 
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DISCLOSURE OF THE INVENTION 

[0008] The present inventors have undertaken intensive studies to solve the above problems. As a result, they have 
successfully achieved comprehensive analysis of gene functions by introducing into a plant group Agrobacterium hav- 
s ing a T-DNA vector containing a full-length cDNA and a promoter capable of constitutively or conditionally inducing 
expression in a plant cell; overexpressing the full-length cDNA; and confirming a phenotype of the transformed pfant 
body, thus completing the present invention. 
[0009] Namely, the present invention is as follows. 

10 (1) A method for analyzing a gene function comprising the steps of: 

(a) infecting a plant group with a full-length cDNA library containing an expression regulatory sequence to 
introduce the cDNA into the piants; 

(b) selecting a plant complying with desired selection requirements from the plant group having the cDNA 
is introduced thereinto; 

(c) isolating the cDNA from the selected plant; 

(d) re-introducing the isolated cDNA into a plant of the same species as the plant into which the cDNA has 
been introduced to reconfirm a phenotype according to the selection requirements. 

20 (2) A system for analyzing a gene function comprises: 

(a) means for infecting a plant group with a full-length cDNA library containing an expression regulatory se- 
quence to introduce the cDNA into the plants; 

(b) means forseiecting a plant complying with desired selection requirements from the plant group having the 
25 cDNA introduced thereinto; 

(c) means for isolating the cDNA from the selected plant; 

(d) means for re-introducing the isolated cDNA into a plant of the same species as the plant into which the 
cDNA has been introduced to reconfirm a phenotype according to the selection requirements, 

30 in the above analytical method and system, as the expression regulatory sequence, a constitutive expression 

promoter, an inducible promoter or a combination thereof may be exemplified. Further r a full-length cDNA library 
that is introduced into Agrobacterium can be used. In this case, the cDNA is introduced into a plant by infection of 
Agrobacterium. 

Further, as the selection requirements, morphogenetic mutation and/or stress resistance may be given. As a 
35 stress, exemplified is at least one form of stress selected from the group consisting of oligotrophy stress, drought 

stress, temperature stress (low temperature or high temperature stress), strong light stress, ultraviolet stress, salt 
stress, air pollution stress, agricultural chemical stress, oxidization stress, heavy metal stress, disease damage 
stress, and hormone stress. 

40 (3) A plant containing a gene having a function analyzed by the above analytical method. 

[0010] The above plant includes any plant selected from the group consisting of a plant body, a seed, a callus, and 
a protoplast. 

[0011] The present invention is a method for analyzing gene functions of plants as a group, without conducting gene 
45 analysis of each plant, by introducing a full-length cDNA into a plant group, and confirming a phenotype of the plant 
group. The present inventors named this method the "Fox Hunting System" (Full-length cDNA over-expression gene 
hunting system). This method will be described hereinafter (Fig. 1). 

(1) Means for preparing a full-length cDNA library (Fig. 1 A) 

so 

[0012] A full-length cDNA used for the present invention is defined as a complete copy of mRNA. Even whenacDNA 
longer than the obtained cDNA is present, such cDNA is included in the full-length of cDNAof the present invention. 
[0013] A full-length cDNA library includes information on all the amino acids indispensable when a gene functions, 
and therefore all the functions originally owned by the genes to be introduced can be exhibited. Accordingly, in com- 
55 parison with conventional cDNA libraries, the full-length cDNA library has higher efficiency in terms of functional ex- 
pression. Further, since all the cDNA fragments have original information on an initiation codon and a stop codon, 
protein fusion for expression is not necessary and the efficiency for protein expression is high. 
[0014] Such a full-length cDNA can efficiently be prepared from an mRNA of interest by methods known to a person 
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skilled in the art. At the 5' terminus of mRNA, 7-methylguanosine is added through a 5\S' triphosphate linkage. By 
focusing attention on this point, several technologies for full-length cDNA synthesis have been developed. Methods 
usable for the present invention include, but are not limited to, the Cap-trapper method (Carninci, P., et al., Genomics, 
1996. 37(3): pp. 327-36) and the Cap-finder method (Zhao, Z., et al., J. BiotechnoL, 1999. 73(1): pp. 35-41). 
s [0015] As a vector for cloning a full-length cDNA, it is preferable to use a vector such as Sfil, which recognizes eight 
or more nucleotides and has restriction enzyme sites at both sides of the cDNA insertion site for defining the direction 
of inserted DNA to one way. 

[0016] A full-length cDNA library carrying the above full-length cDNA can be prepared by using methods known to 
a person skilled in the art. Examples of vectors for preparing the library include pTAS and pBIG2113SF. 

10 [0017] Further, the full-length cDNA library used for the method of the present invention can be prepared by using 
mRNAs derived from various organisms. For example, cDNA may be synthesized from mRNAs of yeasts, plant path- 
ogens, or the like, in addition to useful plants such as Arabidopsis thaliana, tomatoes, rice plants, and corn. 
[0018] Plants are higher organisms from which transformant clones are very easily produced, Transforrnants ob- 
tained therefrom are suitable for the production of seeds as resource. Therefore, it is preferable to prepare a full-length 

is cDNA library from a plant. 

[0019] Further advantageously, even if a plant is infected with a library including hundreds of million of clones, only 
one or two clones are introduced into one plant and thus the transformed plants are introduced with different clones. 
[0020] On the other hand, a higher animal cell is intracellular^ partitioned by various membrane structures in the 
same manner as a higher plant cell, and similarities therebetween have been found to a surprising degree in terms of 

20 mechanisms for maintenance of such structures or mechanisms for intracellular signal transduction. Therefore, a full- 
length cDNA library can be prepared from mRNAs derived from animals. This means that it is possible to identify 
functions of animal genes that are common with a higher plant by introducing a full-length cDNA derived from a higher 
animal into a higher plant. It is confirmed that many genes of Arabidopsis thaliana exhibit high homology with human 
genes. For example, it has been known that genes similar to 1 69 genes of 289 genes which are causative genes for 

25 human genetic diseases that have been identified so far are present on the genome of Arabidopsis thatiana, and these 
genes have high homology (Tabata, Kagaku-to-seibutu (Chemistry and Biology), Vol., 39, No. 6, 2001). 
[0021] Since all the mRNA molecules are substituted with cDNA molecules at an equivalent quantitative ratio in a 
conventional cDNA library, groups of structural protein genes having high level of expression account for large part of 
molecules in the library. In contrast, groups of genes such as those associated with signal transduction often account 

so for an extremely small portion in the library since those gene groups usually have low expression levels. Thus, the 
abundance ratios of individual cDNA clones are largely varied depending on the gene expression level. It is then 
preferable to prepare a library so that all the clones are contained therein at equivalent ratios regardless of the gene 
expression level. Preparing such library is referred to as "normalization." 

[0022] When an equivalent amount of each clone from this full-length cDNA is mixed, a normalized full-length cDNA 
35 mixture can be obtained. The 5 l and 3' terminal sequences of the synthesized full-length cDNAs are determined, and 
thereby full-length cDNAclones having no overlapping (there are no common sequences in a partial region at termini) 
where terminals do not have a common sequence in a partial region thereof) are selected. Based on the results, a 
database is made. 

[0023] The normalized full-length cDNA library is constructed by selecting cDNAs different from each other and 
40 mixing them in equivalent amounts. Thus, it is uniform on the whole in terms of molecular species although conventional 
cDNA libraries have nonuniformity. Accordingly, when a multicopy gene group of a genomic gene is taken into account, 
the functions of different genes can be examined more fairly, that is, with higher efficiency, than in a case where tagging 
of genome is carried out. 

[0024] Further, normalized full-length cDNAs corresponding to 50% or more of the entire genome are provided for 
45 Arabidopsis thatiana. With respectto human genes, normalized full-length cDNAs corresponding to approximately 80% 
or more of the entire genome are present. A resource of the same type as above is being prepared for rice, a useful 
plant. Accordingly, these provided normalized full-length cDNAs can also be used for the present invention, 
[0025] However, the normalization of a full-length cDNA library is not always necessary for the method of the present 
invention. Whether or not the normalization is carried out can be properly determined by a person skilled in the art 
so based on various factors and cost, such as the extent to which the genomic information of a target organism is known 
or the expected expression level of a target gene. For example, when a person wishes to analyze the function of a 
structural protein, a conventional full-length cDN A library is sufficient for analysis since a gene for that protein is con- 
sidered to be highly expressed. 

55 (2) Means for cloning a full-length cDNA into an expression vector (Fig. 1 B) 

[0026] The obtained full-length cDNA or normalized full-length cDNA is cloned into aT-DNA expression vector used 
for plant transformation with Agrobacterium tumefaciens. T-DNA is a specific region of aTl plasmid, which Is found in 



4 



EP 1 431 392 A1 



a pathogenic strain of Agrobacterium, pathogenic bacteria of crown gall, which is a form of dicotyledon tumor. When 
a plant is infected with these bacteria, T-DNAs are transferred into plant cells to be incorporated into a genomic DNA. 
[0027] This T-DNA contains a sequence for regulating the expression of the full-length cDNA therein. As an expres- 
sion regulating sequence, a cassette wherein a terminator is figated to a promoter sequence that constitutiveiy or 
5 conditionally induces expression In a plant cell Is preferably incorporated. Preferable constitutive expression promoter 
sequence include a 35S promoter sequence of Cauliflower Mosaic Virus (Sanders, P. R. et aL, Nucleic Acids Res, 
1987. 15(4): pp. 1543-58). Examples of inducible promoters include glucocorticoid inducible promoter sequence (Aoya- 
ma, T. et aL, Plant J, 1997. 11(3): pp. 605-12), estrogen inducible promoter sequence (Zuo, J, et al„ Plant J, 2000, 24 

(2) : pp. 265-273). According to the present invention, these promoters may arbitrarily be combined (ligated) with each 
10 other for use. Combinations may be made between constitutive expression promoters, between inducible promoters, 

or between both constitutive expression and inducible promoters, 

[0028] The aforementioned full-length cDNA or normalized full-length cDNA is inserted into the downstream of the 
promoter sequence in the sense or antisense direction through enzymatic reaction. When a sense chain is expressed 
by this insertion, changes of a phenotype caused by overexpression of a gene encoding the inserted cDNA can be 
15 observed. When an antisense chain is expressed, changes of the phenotype caused by underexpression of a gene 
encoding the inserted cDNA can be observed. 

(3) Means for transducing a full-length cDNA library into a plant (Fig. 1 C) 

20 [0029] Next, a group of T-DNAs (a Full-length cDNA over-expressor library; FOX library) wherein this full-length 
cDNA has been inserted is introduced into Agrobacterium by a conventional method to produce a library. Thereafter, 
cDNAs in the library are introduced (for transformation) into a plant through infection with Agrobacterium. 
[0030] Plants to be infected with Agrobacterium may be either dicotyledon or monocotyledon. However, when mono- 
cotyledon is used, addition of a phenolic compound (acetosyringone) to a medium is preferable for efficient infection 

25 with Agrobacteriu m . 

[0031] Plants as described above mean all of entire plant bodies, plant organs (e.g., leaves, flower petals, stems, 
roots, and seeds), plant tissues (e.g., epidermis, phloems, parenchymas, xylems, and vascular bundles), or cultured 
plant cells (including protoplasts and calluses). 

[0032] Plants to be used for transformation include, but are not limited to, those belonging to Brassicaceae, Gramine- 
30 ae, Solanaceae, and Leguminosae (see below). 

Brassicaceae: Arab/dopsis thaliana 
Solanaceae: Nicotiana tabacum 
Gramineae: Zea mays, Oryza sativa 
35 Leguminosae: Glycine max 

[0033] A dipping method may be employed for Infection of a plant with Agrobacterium. In the case of using a dipping 
method, a bundle of plant bodies are dipped in a liquid containing Agrobacterium for 30 to 60 seconds. If necessary, 
cells prepared by tissue culture may be co-cultured. 
40 [0034] Individual Agrobacterium in the library have vectors (T-DNA) into which different cDNAs have been inserted, 
and one or two copies of T-DNA are usually inserted per single plant individual. Therefore, it is said that a group of 
these transformed plants are a group of plant clones, all of which can strongly express one to two different full-length 
cDNAs. Among the group of these plant individuals, amutantof interest is selected by several methods described below. 

45 (4) Selection means by phenotype (Fig. 1 D) 

[0035] While a group of plants having overexpressive cDNAs introduced thereinto is subjected to selection at T1 
generation in accordance with antibiotic resistance, various kinds of stress resistance are additionally used to select 
a surviving (adaptable) mutant strain alone (referred to as "tailor made screening"). 
so [0036] Further, a method can be employed wherein a callus prepared by tissue culture of a wild type plant Is infected 
with the above Agrobacteria FOX library by a co-culture method. When the co-culture method using tissue culture is 
used, selection based on transformation and phenotype can be carried out at the same time. Thus, It is possible to 
use as a transformation material a plant that has poor transformation efficiency by dipping, 

[0037] According to the above selection method, a mutant of interest can easily be obtained by conducting a search 
55 regarding functions of the entirety of the genes of the library as a group. This does not require the preparation of a full 
set of plant group containing the full-length cDNAatonetime, which is referred to as "lining up". Therefore, it is possible 
with slight labor to screen genes capable of imparting specific properties (it does not matter whether functions are 
originally owned by that gene). 
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[0038] Furthermore, a plant is transformed using the Agrobacterium FOX library and seeds are harvested from the 
transformed plant. Thereafter, the entirety of the transformants that have become antibiotic-resistant are grown and 
seeds from individual plants are collected. The group of the thus collected seeds (seed library) can be used as material 
to conduct screening, which uses various phenotypes as indexes. 
5 [0039] Plants of interest by phenotype can be isolated by various selection conditions such as resistances to various 
forms of stress, morphogenetic mutations, environment-responsive mutations, and secondary metabolite mutations. 
In this case, plants include plant bodies, seeds, and calluses (Fig. 1 E). 

[0040] Examples of forms of stress include oligotrophy stress, drought stress, temperature stress (low temperature 
or high temperature stress), strong light stress, ultraviolet stress, salt stress, air pollution stress, agricultural chemical 
10 stress, oxidation stress, heavy metal stress, disease damage stress, and hormone stress. These stresses may be 
applied either alone or in combination. 

[0041] "Oligotrophy stress" means a form of stress caused by the lack of at least one of nitrogen, phosphoric acid, 
or potassium as major soil nutrient components, or by a decrease thereof to 50% or less of their usually required 
amounts. 

15 [0042] "Drought stress" means a form of stress resulting when a continuous or temporary impact is given by the 
depletion of the water. 

[0043] "Temperature stress" means a form of stress resulting when a plant is placed at a temperature higher or lower 
than the optimal temperature for plant growth. For example, "high temperature stress" is defined as a form of stress 
caused by a continuous or temporary impact of conditions where the temperature is 42°C or higherfor several minutes 
or longer. "Low temperature stress" is defined as a form of stress caused by a continuous or temporary impact of 
conditions where the temperature is -4°C or lower for several minutes or longer. 

[0044] "Strong light stress" is defined as a form of stress resulting when a plant is irradiated with a strong light beyond 
its photosynthesis ability, and it is a case, for example, when a light with 1 000 to 2000 p,mol/s/m 2 or more is radiated. 
[0045] "Ultraviolet stress 1 ' means a form of stress resulting when a plant is placed under a state wherein ultraviolet 
25 irradiation (1 to 10 mJ/cm 2 min. or more) with a wavelength of 1 00 to 400 nm is carried out. 

[0046] "Salt stress" means a form of stress caused when physiological functions of a plant are damaged as in a case 
wherein a plant cannot absorb water since the water potential decreases due to salts accumulated in soil. Examples 
of this form of stress include a form of stress caused by salt damage that is brought about through irrigation in a dry 
region. 

so [0047] "Air pollution stress" means a form of stress given by a continuous or temporary impact of air pollutants (ozone, 
sulfur dioxide, COX, etc.). 

[0048] "Agricultural chemical stress" means a form of stress resulting when a plant comes into continuous or tem- 
porary contact with agricultural chemicals. 

[0049] "Oxidation stress" means a form of stress caused by active oxygen. 
35 [0050] "Heavy metal stress" means growth inhibition caused by the increase of concentrations of heavy metals in 
soil, such as aluminum, copper, zinc, nickel, manganese, or cadmium. 

[0051] "Disease damage stress" means a form of stress caused when a plant suffers damage due to viruses, fungi, 
or insects, and examples thereof include blast disease, powdery mildew, brown leaf rust, Granville wilt, mosaic disease, 
and root rot. 

40 [0052] "Hormone stress" means a form of stress caused when various plant hormones or environmental endocrine 
disruptors induce morphological abnormalities or metabolism abnormalities. 

[0053] These stress-resistant plants are created by selecting plants that do not die but exhibit resistance to sufficiently 
stressful conditions for plants (e.g., no watering for one week or cutturing at 37°C or higher). 

[0054] Further, "morphological mutation" means a mutation that induces morphogenesis different from that of a wild 
45 type of plant under its normal environment. Examples of plants wherein morphological mutation has occurred include 
plants with turions, dwarfed plants, plants having upsized leaves, and plants having well-running roots. 
[0055] "Environment-responsive mutation" means a mutation occurring in response to any environmental signal usu- 
ally detected by plants such as temperature, light or gravity. Examples of plants wherein environment-responsive mu- 
tation has occurred include plants that produce easy-to-germinate or difficulMo-germinate seeds, plants that can or 
50 cannot initiate the formation of flower bud immediately, and plants that bend or do not bend toward as following the 
direction of light or gravity, 

[0056] "Secondary metabolite mutation" means a mutation wherein the amount of the entirety of secondary metab- 
olites or a specific secondary metabolite produced by plants increases or decreases. In this connection, raw materials 
of many Chinese herbal medicines are derived from particular plants having such mutations. 
55 [0057] In addition, mechanisms may exist whereby a plant exhibits a specific phenotype by the introduction of a full- 
length cDN A. When a sense mRNA is expressed, it is considered that a larger amount of normal protein than usual or 
normal protein in a tissue wherein the protein is not usually produced is produced because of its strong expression, 
thereby resulting in the appearance of a specific phenotype. It is further considered that the silencing effect of the 
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sense mRNA decreases the amount of normal protein, thereby resulting in the appearance of a specific phenotype. 
In contrast, in the case of an antisense mRNA, it is thought that decreased expression of normal protein results in the 
appearance of a specific phenotype. An important point herein is that the specific phenotype appears as a dominant 
or semidominant phenotype regardless of mechanisms according to the system of the present invention. Therefore, 
5 changes of phenotype are considered attributable to the introduced full-length cDNA. 

(5) Means for reconfirming a phenotype and identifying a gene causative of a mutant character (phenotype) (Figs. 1 F 
toH) 

w [0058] Next, a genomic DNA is extracted from the isolated transformed plant, and primers are designed from this 
DNA, based on information on a nucleotide sequence adjacent to a promoter sequence and terminator sequence 
contained in a T-DNA. Using these primers, PCR is carried out and a cDNA located between these transcriptional 
control regions is isolated (Fig. 1F). This cDNA is inserted into a T-DNA having the same promoter and terminator 
sequences as above, and the T-DNA is re-introduced into a normal plant of the same species as the transformed plant 

is that had been isolated previously (Fig. 1G). The phenotype of stress resistance is thereby reconfirmed (Fig. 1H).Then, 
the sequencing of the cDNA enables the identification of a gene causative of a mutant character. 
[0059] In the introduction of the full-length cDNA into a ptant with Agrobacterium, the number of introduced copies 
of the full-length cDNA is one or two. Therefore, it Is sufficient that a process for isolating the cDNA and confirming the 
phenotype Is repeated at most twice, so that fabor-saving and efficiency improvement can be achieved. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0060] 

25 Fig, 1 is a view illustrating the outiine of the method of analyzing gene functions using a full-length cDNA according 

to the present invention. 

Fig. 2 is a photograph showing a T1 plant body of the F03024 line. 
Fig. 3 is a photograph showing a T2 plant body of the F03024 line. 

Fig. 4 is a photograph showing a T1 plant body produced by re-tntroducing F03024 using pB!G03024. 

30 

BEST MODES FOR CARRYING OUT THE PRESENT INVENTION 

[0061] The present invention will herein after be described in detail with reference to Examples. However, the technical 
scope of the present invention is not limited to these Examples. 

35 

[Example 1] Gene function analysis 

[0062] in this example, cDNAs that induce mainly visible phenotypes such as morphological abnormalities were 
screened using pBIG2113SF prepared by introducing an Sfil cloning site into a constitutive expression vector 
40 P BIG2113N (Taji, T. et al., Plant J, 2002, 24(4): pp. 417-426, and Becker, D, et al. Nucleic Acid Res., 1990, 18(1): pp. 
203). 

(1) Preparation of normalized full-length cDNA mix 

45 [0063] Full-length cDNAs were prepared from Arabidopsis by the CAPtrapper method. The cDNA was cloned at site 
flanked by Sfjl restriction enzyme sites of Lambda ZAP or Lambda pLC-1-B (Seki M, et al. Plant J., 15, pp. 707-720 
(1 998)). Using a vector sequence, 5'- and 3'- termini of the cDNA were determined, grouping of the cDNAs was carried 
out, and 13,000 types of independent clones were identified (Seki M. et al. Plant Physiol. Biochem. 39, pp. 211-220 
(2001)). Next, 0,5ji of each ctone that was prepared to 50 ng/jil was collected and all of them were mixed into a single 

so test tube. Then, 1 \i\ of the mixture solution was collected and 20 \i\ of Electric (Electroporation) competent cells DH10B 
(Gibco BRL) were transformed. Approximately 200,000 independent colonies grown on agar media containing Amp 
were mixed, and plasmids were collected therefrom. The obtained product is referred to as a normalized full-length 
cDNA mix. 

ss (2) Preparation of FOX Agrobacteria library 

[0064] Two pig of the normalized full-length cDNA mix and 700 jxg of pBIG211 3SF were mixed and cleaved with Sfil 
at the same time. After the cleavage the obtained product was concentrated by isopropanol precipitation. Then, the 
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concentrated product was dissolved ]n 8 jil of water and mixed with 1 \i\ of10x buffer and 1 uj of T4 ligase for reaction 
for 24 hours at 16°C. Thereafter, 2 pj of the reaction solution was mixed with 40 uj of Electric competent cells DH1 OB 
to be transformed. 

[0065] Approximately 150,000 independent colonies grown on agar media containing Kanamycin (Km) were mixed 
s and plasmids were collected therefrom. Then, 2 uJ of the collected plasmid solution was mixed into 40 jil of Electric 
competent Agrobacterium cells GV3101 to be transformed. Approximately 150,000 independent colonies grown on 
agar media containing Km were suspended in an LB liquid medium and glycerol was added so as to result in a con- 
centration of 1 5%. The resultant product was stored at - 80°C. This glycerol solution is referred to as a FOX Agrobacteria 
library, 

w 

(3) Preparation of FOX line 

[0066] Using the above FOX Agrobacteria library, approximately 200,000 colonies were grown and suspended in a 
dipping solution. Then, wild-type Arabidopsis (ecotype: Colombia) was dipped in the solution. The seeds from the 
is plants were harvested, and germinated in an oligotrophic medium BAM containing hygromycin. Approximately 800 
lines of plants exhibiting hygromycin resistance were selected and transplanted in soil. 

(4) Phenotype screening 

20 [0067] Among approximately 800 lines, approximately 90 lines were selected having clear morphological differences 
from wild types or pigment abnormalities, which could be observed by the naked eye. Representative phenotypes 
included dwarfing or enlargement of plant bodies, pigment abnormalities of plant bodies, branching abnormalities, 
morphological abnormalities of leaves and inflorescences, and fertile abnormalities. 

25 (5) Recloning of cDNA 

[0068] Among 90 lines expressing these phenotypes, approximately two rosette leaves (approx. 200 mgfw) were 
collected from each of 47 lines, and genomic DNAs were extracted therefrom. PCR on these DNAs was carried out. 
A PCR reaction solution having the composition below was used and the reaction was carried out through 40 cycles 
30 of PCR, each cycle consisting of 30 seconds at 94°C, 30 seconds at 58°C, and 3.5 minutes at 68*C. 



Composition of reaction solution 


Primers (100 pM) 


2X0.5 jjlI 


dNTP (200 u.l) 


5uJ 


Buffer (x 10) 


5u.l 


Polymerase 


1 [Xl 


Genomic DNA 


10 


Distilled water 


28 uj 


Total 


50 jxl 



[0069] The primers for PCR were as follows. 

GS4: ACATTCTACAACTACATCTAGAGG (SEQ ID NO: 1) 



GS6: CGGCCGCCCCGGGGAT (SEQ ID NO: 2) 

50 

[0070] The PCR products were collected from agarose gel, mixed with pBIG2113SF, and then completely cleaved 
with Sfil. Then, the products were precipitated with isopropanol and treated with T4 ligase. With the obtained product, 
E cofi was transformed. Plasmids having PCR fragments inserted thereinto were selected, and the nucleotide se- 
quences of inserted cDNA fragments were identified using GS4 or GS6. 

55 

(6) Full-length cDNA inserted into FOX line 

[0071] From 40 lines in which cloning was successfully carried out, 43 types of full-length cDNA sequences were 
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found. As shown in Table 1 , 42 types of them each had different sequences and were found to be inserted directly 
downstream of promoters of vectors. 

Table 1 



5 


line 


fragment number 


mips code 


annotation 




F00521 




At5g27150 


NHE1 Na+/H+ exchanger 




F00602 




At1g08460 


hypothetical protein 


10 


F00718 




At2g20880 


AP2 domain transcription factor 




F00721 




At3g16400 


putative lectin 




F00732 




At5g50700 


1 1 -beta-hydroxysteroid dehydrogenase-like 




F00830 




At5g54270 


Lhcb3 chlorophyll a/b binding protein 


15 


F00935 




At3g28670 


unknown protein 




F01022 




At1gl5820 


hypothetical protein 




F01027 




At4g27170 


NWMU4 - 2S albumin 4 precursor 


20 


F01049 




At4gl3930 


hydroxymethyltransferase 




F01205 




At3g43810 


calmodulin 7 




F01305 




At1gB2500 


putative proline-rich cell wall protein 


25 


F01310 




At3g25920 


SOS ribosornal protein L15, chloroplast precursor 


F01317 


1 


At3g60830 


Actin like protein 




F01325 






S-adenosylmethionine decarboxylase 




F01407 




At5g63850 


amino acid transporter AAP4 


30 


F01408 




At5g59090 


cucumlsin precursor - like 




F01410 




At4g35260 


NAD+ dependent isocitrate dehydrogenase subunit 1 




F02304 




At2g45560 


putative cytochrome P450 


35 


F02318 




At1 g76180 


hypothetical protein(dehydrin like) 


F02347 




At1g66820 


hypothetical protein 




F0251G 




At2g30590 


unknown protein 




F02607 


2 


At2g32210 


unknown protein 


40 


F02623 


2 


At5g54l80 


putative protein 




F02635 


2 


At3g01160 


hypothetical protein 




F02742 




At4g00l00 


putative ribosornal protein S13 


45 


F02813 




At2g46540 


expressed protein 


F03039 




At5g43560 


unknown protein 




F0304B 


3 


At2g462B0 


Eukaryotic translation initiation factor 3 delta subunit 




F03124 




At2g43100 


3-isopropylmalate dehydratase, small subunit 


50 


F03142 


2 


At3g19516 


hypothetical protein 




F03205 




At1g23060 


unknown protein 




F03208 




At2g36970 


putative glucosyltransferase 


55 


F03209 




At5g07990 


flavonoid 3'-hydroxylase - like protein 


F03213 




At3g10020 


unknown protein 




F03215 


1 


At3g45140 


lipoxygenase AtLOX2 
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Table 1 (continued) 



line 


fragment number 


mips code 


annotation 


F03218 


4U 


At2g1 6080 


putative protein 


F03218 


4L 


At3g 16640 


Transiationally controlled tumor protein-like protein (TCTP 
homologu) 


F03224 


2 


At5g56420 


putative protein 


F03224 


3 


At2g1 0940 


unknown protein 


F03232 




At5g47610 


putative protein 


F23 


L 


At5g49940 


nifU like protein 



(7) Reconfirmation of phenotypes and identification of gene functions 

[0072] pBIG03024, one of the plasmids prepared by cloning described above was a plasmid into which a full-length 
cDNA of approximately 2.4 kb cloned from the F03024 tine (shown in Fig. 2) exhibiting a pale green T1 plant body was 
inserted downstream of a promoter in a sense direction. Further, as shown in Fig. 3, dominant segregation with a ratio 
of 3:1 was demonstrated in a T2 plant of F03024 in terms of the character of pale green. Agrobacteria GV31 01 was 
transformed with pBlG03024. Thereafter, dipping, and selection and breeding of hygromycin-resistant lines were car- 
ried out in completely the same manner as the preparation of FOX plants. Then , pale green plants that were completely 
the same astheTI line of F03024 were observed in 1 0 of 21 lines (shown in Fig. 4). Based on this result, cDNA inserted 
into pBIG03024 was identified as a gene relating to green pigment production in plant. 

INDUSTRIAL APPLICABILITY 

[0073] The system of the present invention enables dominant mutants to be obtained in the same way as in a con- 
ventional activation tagging method. In addition, since the number of genes incorporated into a T-DNA is one or two, 
a gene with overexpression can easily be identified. Further, since the gene is already isolated as a cDNA, further 
isolation of a causative gene is not necessary and thus a phenotype of the causative gene can easily be reconfirmed 
and identified using a genomic DNA isolated from a mutant plant. Furthermore, according to the system of the present 
invention, as long as a full-length cDNA is prepared, the origin of the cDNA is not limited to a specific species of an 
organism, thereby enabling gene function analysis of various organisms. 

Sequence list free text 

[0074] 

SEQ ID NO: 1 Synthetic DNA 
SEQ ID NO: 2 Synthetic DNA 
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SEQUENCE LISTING 

<H0> RIKEN 

<120> A SYNTHETIC GENE FUNCTION-ANALYZING SYSTEM USING 
FULL-LENGTH cDNA 

<130> RJH14-0HT 

<150> JP 2001-264156 
<151> 2001-8-31 

<160> 2 

<170> Patent In Ver. 2.0 

<210> 1 
<2I1> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 1 

acattctaca actacatcta gagg 

<2I0> 2 
<2N> 16 
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<2l2> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 2 

cggccgcccc ggggat 16 



Claims 

1 . A method for analyzing a gene function comprising the steps of: 

(a) infecting a plant group with a full-length cDNA library containing an expression regulatory sequence to 
introduce the cDNA into the plants; 

(b) selecting a plant complying with desired selection requirements from the plant group having the cDNA 
introduced thereinto; 

(c) isolating the cDNA from the selected plant; and 

(d) re-introduclng the isolated cDNA into a plant of the same species as the plant into which the cDNA has 
been introduced to reconfirm a phenotype according to the selection requirements. 

2. The method according to claim 1, wherein the expression regulatory sequence is a constitutive expression pro- 
moter, an inducible promoter, or a combination thereof. 

3. The method according to claim 1 , wherein the full-length cDNA library is introduced into Agrobacterium. 

4. The method according to claim 1 , wherein the cDNA is introduced into the plant by infection with Agrobacterium, 

5. The method according to claim 1 , wherein the selection requirements are based on stress resistance and/or mor- 
phogenetlc mutation. 

6. The method according to claim 5, wherein the stress is at least one form of stress selected from the group consisting 
of oligotrophy stress, drought stress, temperature stress, strong light stress, ultraviolet stress, salt stress, air pol- 
lution stress, agricultural chemical stress, oxidization stress, heavy metal stress, disease damage stress, and 
hormone stress. 

7. A system for analyzing a gene function comprising: 

(a) means for infecting a plant group with a full-length cDNA library containing an expression regulatory se- 
quence to introduce the cDNA into the plants; 

(b) means for selecting a plant complying with desired selection requirements from the plant group having the 
cDNA introduced thereinto; 

(c) means for isolating the cDNA from the selected plant; and 

(d) means for re-introducing the isolated cDNA into a plant of the same species as the plant into which the 
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cDNA has been introduced to reconfirm a phenotype according to the selection requirements. 

8. The system according to claim 7, wherein the expression regulatory sequence is a constitutive expression pro- 
moter, an inducible promoter, or a combination thereof. 

9. The system according to claim 7, wherein the full-length cDNA library is introduced into Agrobacterium. 

10. The system according to claim 7, wherein the cDNA fs introduced into the plant by infection with Agrobacterium. 

11. The system according to claim 7 t wherein the selection requirements are based on stress resistance and/or mor- 
phogenetic mutation. 

12. The system according to claim 11 , wherein the stress is at least one form of stress selected from the group con- 
sisting of oligotrophic stress, drought stress, temperature stress, strong light stress, ultraviolet stress, salt stress, 
air pollution stress, agricultural chemical stress, oxidization stress, heavy metal stress, disease damage stress, 
and hormone stress. 

13. A plant containing a gene with a function that has been analyzed by an analytical method according to any of 
claims 1 to 6. 

14. The plant according to claim 13, wherein the plant is a member selected from the group consisting of a plant body, 
a seed, a callus, and a protoplast. 
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Fig. 3 
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Fig. 4 
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